A Comparison of Similarity Techniques for Detecting Source Code Plagiarism
نویسنده
چکیده
Academic dishonesty is a universal problem. Detecting duplicated text among natural language artifacts is a welldocumented task. However, performing similar analysis on source code presents unique problems. In this paper, I present a comparison of the application of various techniques in textual similarity processing on source code. Beyond this, I investigate the application of textual similarity algorithms on the Abstract Syntax Tree (AST), Control Flow Graph (CFG), and Intermediate Representation (IR) produced by the LLVM compiler toolchain. Aggregate similarity scores of a variety of methods compare favorably against the current state-of-the-art source code plagiarism tool.
منابع مشابه
COAT: Code ObfuscAtion Tool to evaluate the performance of code plagiarism detection tools
There exist many plagiarism detection tools to uncover plagiarized codes by analyzing the similarity of source codes. To measure how reliable those plagiarism detection tools are, we developed a tool named Code ObfuscAtion Tool (COAT) that takes a program source code as input and produces another source code that is exactly equivalent to the input source code in their functional behaviors but w...
متن کاملDetecting Functional Similarity between Java Files Using Metrics
Two similar pieces of software code is called clones. Software Developers often copy a section of code, and then paste it with or without modification .Software clone detection is employed to lessen the software maintenance cost and to improve understandability of the system. It also helps in plagiarism detection. Many code clone detection techniques exist and they detect and identify various t...
متن کاملAn Abstract Method Linearization for Detecting Source Code Plagiarism in Object-Oriented Environment
Despite the fact that plagiarizing source code is a trivial task for most CS students, detecting such unethical behavior requires a considerable amount of effort. Thus, several plagiarism detection systems were developed to handle such issue. This paper extends Karnalim’s work, a low-level approach for detecting Java source code plagiarism, by incorporating abstract method linearization. Such e...
متن کاملA comparison of plagiarism detection tools
In this paper we compare five tools for detecting plagiarism in source code texts: JPlag, Marble, moss, Plaggie, and sim. The tools are compared with respect to their features and performance. For the performance comparison we carried out two experiments: to compare the sensitivity of the tools for different plagiarism techniques we have applied the tools to a set of intentionally plagiarised p...
متن کاملA Source Code Similarity System for Plagiarism Detection
Source code plagiarism is an easy to do task, but very difficult to detect without proper tool support. Various source code similarity detection systems have been developed to help detect source code plagiarism. Those systems need to recognize a number of lexical and structural source code modifications. For example, by some structural modifications (e.g. modification of control structures, mod...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014